Microsoft Launches Agent Lightning: A New AI Framework to Help Train Large Language Models with Reinforcement Learning
Microsoft launches the open-source framework Agent Lightning, which uses reinforcement learning to optimize multi-agent systems. The framework does not require changes to existing architectures and can convert real agent behaviors into reinforcement learning transitions, improving the performance of strategies in large-scale language models. It models agents as partially observable Markov decision processes, using the current input as an observation, model calls as actions, and introducing a reward mechanism.